Skip to main content

Crisp repos. Sharp AI. Give AI the blueprint, not the code.

Project description

Cresbee Crespo

License Version Python Downloads Stars

Give AI the blueprint, not the code.

Stop burning your context window on raw source files.
Crespo extracts what matters — and compresses everything else.

pip install crespo && crespo ./myproject

The Problem

You paste a codebase into any LLM. It hits the context limit but you problem remains unfinished. You paste one file at a time. The AI loses the big picture. You give AI the full code. The AI reads 40,000 tokens linearly and still misses the architecture.

Crespo solves this differently.

Instead of concatenating raw files, Crespo uses Tree-sitter AST parsing to extract only the structural DNA of your repository — imports, classes, functions, module connections — and emits a compact XML blueprint. Same architectural understanding. A fraction of the tokens.


Usage

# Structure mode (default)
crespo ./myproject

# Summary mode — requires Groq key
crespo ./myproject --mode summary --groq YOUR_KEY

# Save your Groq key for future runs
crespo --groq YOUR_KEY

# Concat mode — full source, redacted
crespo ./myproject --mode concat

# Analyse a GitHub repo directly
crespo --git https://github.com/user/repo

# Custom output filename
crespo ./myproject --output blueprint.xml

How It Works

your repo
    │
    ▼
 walker          respects .gitignore · skips tests · skips build artifacts
    │
    ▼
 tree-sitter     real AST parse · 10 languages · no regex
    │
    ▼
 extractor       imports · classes · functions · structs · enums
    │
    ▼
 blueprint XML   compact · structured · LLM-ready

No heuristics. No regex scraping. Real language grammars via Tree-sitter — the same parser used by GitHub, Neovim, and Zed.


Modes

Mode What it produces Best for
structure AST skeleton — imports, classes, functions Architecture review, onboarding, LLM context
summary Structure + AI one-line descriptions per file Deeper codebase understanding
concat Full source, secrets redacted, in structured XML Passing entire repos to LLMs safely

Languages supported

Python · JavaScript · TypeScript · JSX · TSX · Rust · Go · Java · C · C++


Example output

<?xml version='1.0' encoding='utf-8'?>
<repo n="kara" s="Gesture-controlled PDF viewer using PyQt6, MediaPipe, and OpenCV.">
  <meta>
    <dep>cv2,mediapipe,numpy,PyQt6,fitz,groq,markdown</dep>
  </meta>
  <files>
    <f p="Ui.py" e=".py" s="Main PyQt6 window coordinating PDF rendering, gesture input, and AI summarisation.">
      <imp>PyQt6,fitz,render,summarise,gesture,markdown</imp>
      <cls n="Window">
        <fn n="summary" p="(self)" />
        <fn n="startGest" p="(self, state)" />
        <fn n="gestZoom" p="(self, state: int)" />
      </cls>
    </f>
    <f p="gesture.py" e=".py" s="MediaPipe hand tracking with gesture classification and debouncing.">
      <imp>mediapipe,cv2,numpy</imp>
      <cls n="GestureController">
        <fn n="detect" p="(self, frame)" />
        <fn n="classify" p="(self, landmarks)" />
      </cls>
    </f>
  </files>
</repo>

Benchmarks

Tested on real open-source repositories. Structure accuracy evaluated by asking an LLM three questions from the blueprint alone — no access to the original source.

Structure mode accuracy

Repo Components & connections Dependencies Entry point Score
Axios ✅ correct and specific ✅ correct and specific ✅ correct and specific 3/3
Express ✅ correct and specific ✅ correct and specific ✅ correct and specific 3/3
Kara ✅ correct and specific ✅ correct and specific ✅ correct and specific 3/3
Moodilist ✅ correct and specific ✅ correct and specific ✅ correct and specific 3/3
Requests ✅ correct and specific ✅ correct and specific ⚠️ partially correct 2/3
Urai ✅ correct and specific ✅ correct and specific ⚠️ partially correct 2/3
FastAPI ✅ correct and specific ✅ correct and specific ⚠️ partially correct 2/3
Flask ✅ correct and specific ✅ correct and specific ⚠️ partially correct 2/3
Average 2.75 / 3

⚠️ Entry points are partially correct on framework-level repos (FastAPI, Flask) and convention-driven repos (Next.js) where the entry point is implicit rather than explicit. Architecture and dependency accuracy remains perfect across all tested repos.

Token reduction — structure mode

Repo Raw tokens Blueprint tokens Reduction
Kara ~4,667 ~934 ~80%
Moodilist ~8,580 ~1,396 ~84%
Axios ~61,494 ~6,989 ~89%
Express ~17,222 ~707 ~96%
FastAPI ~145,606 ~124,993 ~14%
Flask ~77,402 ~13,848 ~82%
Requests ~49,556 ~9,585 ~81%
Urai ~ 17,418 ~2,304 ~87%
Average ~86%

FastAPI (14% reduction) excluded from average — as a framework repo its structure IS the content. Crespo correctly preserves it rather than discarding it.

Framework-heavy repos compress slightly less because the preserved structure is genuinely useful — there is less noise to discard.

Token Counting is done using tiktoken python library.

Compression Depends on Repo Type


Security

Crespo redacts secrets before writing any output. Patterns covered:

  • Quoted assignments — api_key = "...", token: '...'
  • Raw .env style — GROQ_KEY=abc123
  • Known key prefixes — Groq (gsk_), OpenAI (sk-), Anthropic (sk-ant-), GitHub (ghp_), AWS (AKIA), Slack (xox)

Groq Setup

Summary mode uses Groq to generate one-line descriptions per file and function. The free tier is more than enough.

# pass once — saved to ~/.crespo/config
crespo --groq YOUR_KEY

# all future summary runs pick it up automatically
crespo ./myproject --mode summary

Your key is stored locally at ~/.crespo/config and never sent anywhere except Groq's API.


Roadmap

  • Something for Humans coming soon!
  • More aggressive compression preset
  • More language support (Ruby, PHP, Swift, Kotlin)
  • .crespoignore support

Troubleshooting

crespo: command not found

This usually means Crespo was installed successfully, but the executable isn't on your system PATH.

Verify installation

python -m pip show crespo

If Crespo appears in the output but the crespo command still isn't recognized, it's a PATH issue — follow the steps below for your OS.

Windows

Add your Python Scripts directory to PATH (typically):

C:\Users\<you>\AppData\Local\Programs\Python\Python3x\Scripts

Restart your terminal afterwards.

macOS / Linux

Find your user scripts directory:

python3 -m site --user-base

Then add its bin folder to your shell configuration:

export PATH="$HOME/.local/bin:$PATH"

Restart your shell and try again.

Multiple Python installations

If you have multiple Python versions installed, make sure installation and execution use the same interpreter. Check which python/pip you're actually using:

where python      # Windows
which -a python3  # macOS / Linux

Reinstall using that same interpreter explicitly if needed:

python -m pip install --force-reinstall crespo

Recommended: use pipx

For CLI tools, pipx avoids most PATH-related issues entirely:

pipx install crespo
crespo ./myproject

Contributing

Contributions are welcome. If you have ideas for new output modes, better parsing, or additional language support, open an issue or PR.


License

MIT © Hrudul Krishna K V




Built by a CS student who kept hitting AI's free limit.
GitHub · PyPI

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

crespo-1.0.16.tar.gz (643.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

crespo-1.0.16-py3-none-any.whl (639.7 kB view details)

Uploaded Python 3

File details

Details for the file crespo-1.0.16.tar.gz.

File metadata

  • Download URL: crespo-1.0.16.tar.gz
  • Upload date:
  • Size: 643.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for crespo-1.0.16.tar.gz
Algorithm Hash digest
SHA256 b0dc1ce000e621762a0818392742d6297dea2f867563c7c71269b33a76ec547c
MD5 0412d55f1e6353c6e6ad84926e1ab3d7
BLAKE2b-256 8b43de906e7c30e898f8b85be186cd3d40f56ec7f70023c3801dc354aa994105

See more details on using hashes here.

Provenance

The following attestation bundles were made for crespo-1.0.16.tar.gz:

Publisher: publish.yml on hrudulmmn/crespo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file crespo-1.0.16-py3-none-any.whl.

File metadata

  • Download URL: crespo-1.0.16-py3-none-any.whl
  • Upload date:
  • Size: 639.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for crespo-1.0.16-py3-none-any.whl
Algorithm Hash digest
SHA256 95c8fefd94d07fca05195fa29db30db9c5eb2aec6d356684996393f34387a9e5
MD5 29c18a33d902afd8d7c38290a9a4d82d
BLAKE2b-256 f3b00ad943188023ee2be861d3f34afa9963e71ce39ab27c9a4b4c0b34f08745

See more details on using hashes here.

Provenance

The following attestation bundles were made for crespo-1.0.16-py3-none-any.whl:

Publisher: publish.yml on hrudulmmn/crespo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page